Quicker ADC : Unlocking the Hidden Potential of Product Quantization With SIMD

نویسندگان

چکیده

Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. A common approach to rely on Product Quantization, which allows the storage large vector databases memory and efficient distance computations. Yet, implementations nearest neighbor with Quantization have their performance limited by accesses they perform. Following this observation, Andr\'e et al. proposed Quick ADC up $6\times$ faster $m\times{}4$ product quantizers (PQ) leveraging specific SIMD instructions. Quicker generalization not codes supporting AVX-512, latest revision instruction set. In doing so, faces challenge using efficiently 5,6 7-bit shuffles that do align computer bytes or words. To end, we introduce (i) irregular combining sub-quantizers different granularity (ii) split tables allowing lookup larger than registers. We evaluate multiple indexes including Inverted Multi-Indexes IVF HNSW show it outperforms reference optimized (i.e., FAISS polysemous codes) for numerous configurations. Finally, release an open-source fork enhanced at http://github.com/nlescoua/faiss-quickeradc.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unlocking hidden genomic sequence.

Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not d...

متن کامل

Unlocking the Hidden Information in Starlight

A provocative new result [1] by Mankei Tsang, Ranjith Nair, and Xiao-Ming Lu of the National University of Singapore suggests that a long-standing limitation to the precision of astronomical imaging, the Rayleigh criterion, proposed in 1879 [2] is itself only an apparition. Using quantum metrology techniques, the researchers have shown that two uncorrelated point-like light sources, such as sta...

متن کامل

Unlocking the Potential of Simulators: Design with RL in Mind

Using Reinforcement Learning (RL) in simulation to construct policies useful in real life is challenging. This is often attributed to the sequential decision making aspect: inaccuracies in simulation accumulate over multiple steps, hence the simulated trajectories diverge from what would happen in reality. In our work we show the need to consider another important aspect: the mismatch in simula...

متن کامل

Unlocking the Potential of Cell Phones

ISSUES The rapid rise of mobile phone use in poor countries is well known as an exemplary case of a technology enabling bottom-up empowerment through information access, driven by smallmargin business and end-user innovation. While many are not mobile phone owners themselves, few today face a several mile walk to access an often-disconnected landline phone for communication, which was a regular...

متن کامل

Unlocking the potential of supported liquid phase catalysts with supercritical fluids: low temperature continuous flow catalysis with integrated product separation

Solution-phase catalysis using molecular transition metal complexes is an extremely powerful tool for chemical synthesis and a key technology for sustainable manufacturing. However, as the reaction complexity and thermal sensitivity of the catalytic system increase, engineering challenges associated with product separation and catalyst recovery can override the value of the product. This persis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Pattern Analysis and Machine Intelligence

سال: 2021

ISSN: ['1939-3539', '2160-9292', '0162-8828']

DOI: https://doi.org/10.1109/tpami.2019.2952606